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GLOBAL MOTION COMPENSATION FOR VIDEO PICTURES 

Cross Reference to Related Application 

This application for patent claims the benefit of priority from, and hereby incorporates by 
reference the entire disclosure of, co-pending U.S. provisional application for patent serial 
number 60/334,979, filed November 30, 2001. 

BACKGROUND OF THE INVENTION 

The invention disclosed and claimed herein generally pertains to a method for 
compression of video signal data. More particularly, the invention pertains to a method of the 
above type which employs global motion compensation. Even more particularly, some 
embodiments of the invention pertains to a method of the above type wherein macroblocks are 
grouped into slices and global motion compensation information is transmitted with encoded 
slices. 

It is anticipated that embodiments of the invention could be used in connection with 
television decoders of standard (SDTV) and high (HDTV) definition digital TV signals, as a part 
of video conferencing systems, and in computers including PCs, laptops and the like for 
decoding video. Embodiments could also be used in mobile devices such as mobile phones and 
PDAs, as a part of a decoder in a digital cinema projector, and in video recorders, players and 
home entertainment systems. However, it is not intended to limit the invention to such 
embodiments. 

Digital video signals, in non-compressed form, typically contain large amounts of data. 
However, the actual necessary information content is considerably smaller due to high temporal 
and spatial correlations. Accordingly, video compression, or coding, is used to reduce the 
amount of data which is actually required for certain tasks, such as storage of the video signals or 
for transmitting them from one location to another. In the coding process temporal redundancy 
can be used by making so-called motion-compensated predictions, where regions of a video 
frame are predicted from similar regions of a previous frame. That is, there may be parts of a 
frame that contain little or no change from corresponding parts of the previous frame. Such 
regions can thus be skipped or non-coded, in order to maximize compression efficiency. On the 
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other hand, if a good match with a previous frame cannot be found, predictions within a frame 
can be used to reduce spatial redundancy. With a successful prediction scheme, the prediction 
error will be small and the amount of information that has tb be coded greatly reduced. 
Moreover, by transforming pixels to a frequency domain, e.g., by using the discrete cosine 
transform, spatial correlations provide further gains in efficiency. 

Herein, the terms "picture" and 'frame" are used interchangeably to refer to a frame of 
image data in a video sequence. 

High temporal correlations are characteristic of video. Hence, much effort in optimizing 
video compression is focused on making accurate temporal predictions of regions of a frame. 
The better the prediction, the less bits are needed to code the discrepancy. The prediction itself is 
coded as instructions on how to translate, or even scale or rotate, a previously coded region. If 
many regions of a frame have similar motion, such as in a pan or zoom, further improvements in 
compression efficiency can result from coding a global motion separately, which then applies to 
all or some regions of the frame. This technique is often referred to as global motion 
compensation (GMC). 

There are several reasons, however, why one should not address the whole frame when 
global motion compensation is used. The first reason is error resilience. In order to prevent error 
propagation from corrupted parts of an image, prediction is often constrained within bounded 
segments called slices. Each slice of a frame should therefore also be self-contained regarding 
global motion information. Another reason is that global motion compensation may not be 
relevant for an entire frame, even though smaller parts of the frame may benefit from global 
motion compensation applied to each part separately. 
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Still image coding versus motion compensation 

A typical video codec, such as ITU-T Recommendations H.261 and H.263, MPEG-1 part 
2, MPEG-2 part 2 (H.262), or MPEG-4 part 2 5 operates by sequentially encoding a video 
sequence frame by frame. A frame is further divided into blocks that are coded sequentially row 
by row, starting at the top left corner and ending at the bottom right corner. A typical block size 
is that of a macroblock (MB) covering 16x16 luminance pixels. 

The first frame in the sequence is encoded as a still image, called an intra frame. Such a 
frame is self-contained and does not depend on previously coded frames. However, they are not 
only used at the start of the sequence, but may also be advantageously used at instances where the 
video changes abruptly, such as scene cuts, or where it is desirable to have a random-access 
point, from which a decoder can start decoding without having to decode the previous part of the 
bitstream. The pixel values of intra-coded macroblocks are usually transformed to a frequency 
domain, e.g. using discrete cosine transform and the transform coefficients quantized in order to 
reduce the size of the resulting bitstream. 

In contrast, an inter frame is coded as a motion-compensated difference image relative to 
an earlier frame. By using an already decoded frame (reconstructed frame) as reference, the 
video coder can signal for each macroblock a set of motion vectors (MVs) and coefficients. The 
motion vectors (one or several depending on how the macroblock is partitioned) inform the 
decoder how to spatially translate the corresponding regions of the reference frame in order to 
make a prediction for the macroblock under consideration. This is referred to as motion 
compensation. The difference between the prediction and the original is encoded in terms of 
transform coefficients. However, not all macroblocks of an inter frame need to be motion 
compensated. If the change from the reference macroblock to the current macroblock is small, 
the macroblock can be coded in COPY mode, i.e. not coded per se but signaled to be copied. See 
section 5.3.1 of ITU-T Recommendation H.263 "Coded macroblock indication (COD) (1 bit)" 
for an example of a COPY mode implementation. On the other hand, if the macroblocks differ 
substantially, it may be better to code it as an intra macroblock. 

Global motion compensation in H.263 
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Instead of addressing the motion compensation on a block basis only, it might be 
advantageous to extract the global motion of a frame separately and code the deviations from the 
global motion for each block. In a passing or zooming sequence, or when a large object moves 
over the frame, the overall motion information is likely to be kept at a minimum by such a 
scheme. A well-known technique is to add an additional step in the coding process before an 
inter frame is coded. Annex P "Reference Picture Resampling" of H.263 provides a method for 
"warping" the reference picture given four displacement vectors v 00 , v° v , and v specifying 
the displacements of the corner pixels of a frame. Figure 1 shows a reference frame, with these 
vectors respectively extending from the corner pixels 8. The displacements of all other pixels are 
given by a bilinear interpolation of these vectors, that is: 



r 4- 



r y + 



r 



9 Eqn.(l) 



where (x, y) is the initial location of a pixel, H and V represent locations of the corner pixels in 
the reference frame, and 



r°= v 00 



r? = v oy -v 00 

For a detailed description of implementation of these formulae, reference may be made to 
Recommendation H.263. When this global motion compensation is used for a subsequent or 
inter frame, the reference frame is resampled, pixel-by-pixel, using the above interpolation. 
After the resampling has been performed, the coder can continue with coding the inter frame, 
based on the resampled reference frame. 

Global Motion Compensation in MPEG-4 part 2 

Global motion compensation is also specified in the MPEG-4 visual standard using so- 
called S(GMC)-VOPs. Here the global motion compensation is applied pixel-by-pixel as for 
H.263 Annex P. However, one can still choose on a macroblock level whether the (interpolated) 
global motion compensated reference frame should be used or not. 
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Global Motion Compensation Proposed for K26L 

ITU-T is currently developing a new video-coding standard, Recommendation H.26L, 
which is also likely to be jointly published as an International Standard by ISO/IEC called 
MPEG-4 AVC (ISO/IEC 14496-10). The current H.26L standard follows the above-mentioned 
general video coding design with frames and macroblocks, where each picture is encoded by a 
picture header followed by macroblocks. This standard is discussed further hereinafter, in 
connection with Figures 8-9. 

A significant disadvantage in using global motion compensation for a video frame is the 
loss of error resilience and flexibility caused by addressing entire frames. Thus, if the global 
motion vectors are coded only once for a picture, e.g. at the beginning of the picture, and this part 
of the bitstream is lost during transmission, the whole picture is likely to be corrupted. 
Accordingly, motion vectors for blocks throughout the picture cannot be decoded and must be 
concealed. Such errors may also propagate in time, since the next picture can be an inter picture 
as well, thus using a corrupted picture as reference. Another problem, specifically regarding 
proposed global motion vector coding (GMVC) design for the H.26L standard, is that different 
global motions for parts of a frame cannot be specified. 

SUMMARY OF THE INVENTION 

The present invention addresses the above problem of using global motion compensation 
for frames by applying global motion compensation to frames which have been segmented into 
slices. Each slice is treated as a self-contained unit that does not use motion vectors from 
macroblocks outside itself for prediction. By coding global motion vectors for each slice, all 
motion vectors within the slice can be constructed or decoded in a self-contained manner. 
Moreover, the global motion vectors can be applied differently to each slice, or can be used to 
repeat global motion vectors referencing the entire frame. By signaling global motion 
compensation for each slice, the advantages of global motion compensation can be realized 
without neglecting error resilience. In addition, compression efficiency can be improved, since 
the invention enables global motion to be better fitted for smaller regions. 

As a further benefit, the invention introduces a new mode for global motion 
compensation, referred to as Implicit Global Motion Compensation (IGMC). This mode can be 
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used as a tool in many coding scenarios which require a more useful MB mode than COPY 
mode, in order to minimize the total number of bits needed for motion vectors. In COPY mode 
the motion vector is always zero. In contrast, IGMC uses implicit motion vectors that are 
predicted. 

The invention can be directed to a method of video data compression for use with image 
blocks derived by dividing a video frame into a sequence of blocks. In a preferred embodiment, 
the blocks are macroblocks comprising 16x16 (luminance) pixels and where one of several 
possible macroblock-coding modes is an implicit global motion compensation (IGMC) mode. 
This mode is used to copy pixels from a previous frame of a collocated block, dislocated by a 
motion vector that is predicted from neighboring image blocks of the current frame. 

Another part of the invention can be directed to a method of video data compression for 
use with slices derived by segmenting a video frame into a sequence of slices, wherein each slice 
comprises a plurality of image blocks. The method comprises the steps of encoding the data of 
respective slices to generate corresponding encoded slices, applying a signal to each encoded 
slice to indicate whether global motion compensation (GMC) is to be used in reconstructing its 
corresponding original slice, and if so, including GMC information with the encoded slice. 
Preferably, motion vectors for each image block of an encoded slice can be reconstructed using 
only GMC information contained in the slice. In one embodiment, at least two of the encoded 
slices contain the same GMC information. In another embodiment, at least two of the encoded 
slices contain different GMC information. Usefully, GMC information contained by respective 
slices of the frame comprises global motion vectors respectively referenced to the video frame. 

In a preferred embodiment, the GMC information contained in a given encoded slice 
comprises or represents a set of encoded global motion (GM) vectors. The set of GM vectors 
may be used together with bilinear interpolation to calculate the global motion compensation for 
each pixel contained in the given encoded slice. Alternatively, the GM vectors may be used with 
bilinear interpolation to calculate global motion compensation for each block( in an array of 4x4 
pixel blocks comprising the given encoded slice. 

In yet another embodiment, a particular encoded slice has a macroblock copy mode that 
signals encoding of the image blocks of the particular encoded slice by copying respectively 
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corresponding image blocks located in a global motion compensated reference frame derived 
from the video frame. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a schematic diagram illustrating global motion compensation of a frame in 
accordance with the H.263 video compression standard. 

Figure 2 is a schematic diagram showing a simplified system for compressing, 
transmitting and decoding video information in accordance with embodiments of the invention. 

Figure 3 is a block diagram showing certain components for the compressor of the system 
shown in Figure 2. 

Figure 4 is a schematic diagram illustrating the partitioning of a frame from a video 
sequence into slices respectively comprising macroblocks. 

Figure 5 is a schematic diagram illustrating motion vectors associated with respective 
pixel blocks of a macroblock. 

Figures 6-7 are schematic diagrams, each showing a slice comprising a sequence of 
macroblocks for illustrating embodiments of the invention. 

Figure 8 is a schematic diagram representing the bitstream syntax of the H.26L design on 
picture and macroblock levels. 

Figure 9 is a schematic diagram representing proposed global motion compensation in 
H.26L on picture and macroblock levels. 

Figure 10 is a schematic diagram showing a slice comprising 4x4 macroblocks for 
illustrating embodiments of the invention pertaining to the H.26L video compression standard. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Referring to Figure 2, there is shown a source 10 of video information such as a video 
camera. The information, comprising a succession of video frames, is coupled to a video coder 
or compressor 12, which compresses successive frames of data in accordance with an 
embodiment of an invention, as described herein. A bit stream representing the compressed data 
is transmitted through a communication channel 22, which may be a wireless communication 
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channel, from a transmitter 14 to a receiver 16. The received data is applied to a decoder 18 to 
recover the video information. 

Referring to Figure 3, there are shown certain conventional components of a compressor 
12 for processing a 16x16 pixel macroblock 20, derived by dividing a frame in a sequence of 
video frames such as is shown in Figure 4. The components shown in Figure 3 include a 
transform module, such as a Discrete Fourier Transform module 24, a quantizer 26 and a binary 
encoder 28. 

As is known in the art, transform module 24 receives an array of integers, comprising 
respective gray scale levels (luminance) and color levels (chrominance) of the pixels of 
macroblock 20. Module 24 applies the transform to the pixel levels to generate an output array 
of transform coefficients. As is likewise well known, quantizer 26 divides each transform 
coefficient by a corresponding step size or quantization level. The output of quantizer 26 is 
directed to binary encoder 28, which generates a corresponding stream of digital bits 30 for 
transmission through channel 22. 

Referring now to Figure 4, there is shown a frame 32 comprising one of the frames in a 
video sequence 34. Figure 4 further shows frame 32 segmented into a number of slices 36a-d, 
wherein each slice 36a-d comprises a sequence of macroblocks 38. Each macroblock comprises 
an array of pixels from frame 32. As described hereinafter in further detail, a slice 36 may 
contain GMC information pertaining to its macroblocks 38 and/or to frame 32, in accordance 
with embodiments of the invention. 

Figure 4 also shows slice 36a in further detail, to emphasize that a slice boundary can 
appear after any macroblock of a frame. Slice 36b is shown to include marcoblocks located in 
several rows 37 of the frame. Moreover, slice 36b starts a few macroblocks from the left frame 
boundary of the slice and ends a few macroblocks before the right frame boundary on the last 
row of the slice. Thus, some slices span over more than one row 37, and a row 37 can contain 
more than one slice, such as the bottom row containing slices 36c and 36d. 

Referring to Figure 5, the pixel blocks 40a-40d' of a previous frame are used for 
predicting the blocks 40a-40d of the current frame. The motion vectors 42a-42d are describing 
the location from where pixels from the first frame shall be copied to blocks 40a-40d of the 
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current frame. This illustrates that the pixel blocks 40a-40d can be readily determined or 
reconstructed by using a previously decoded frame together with the motion vectors 42a-42d. 

Referring to Figure 6, there is shown an encoded slice 40 generated by encoding a slice 
such as slice 36b shown in Figure 4. Encoded slice 40 comprises macroblocks or image blocks 
42 and is provided with a header 44. In one embodiment of the invention, a signal is applied to 
encoded slice 40, for example by including the signal in header 44, to indicate whether or not 
global motion compensation (GMC) is to be used in reconstructing the corresponding original 
slice 36. If GMC is to be used in slice construction, header 44 also includes GMC information. 
Such GMC information usefully comprises or represents motion vector information from which 
the motion vectors for each of the blocks 42 of slice 40 can be reconstructed at the decoder. 
Thus, all motion vectors in slice 40 can be derived exclusively from information contained in 
slice 40. 

In another useful embodiment, all of the slices 40 encoded from respective slices 36 of 
video frame 32, referred to above, contain the same GMC information. For example, the GMC 
information could comprise an encoded set of the GM vectors r°, t? 3 r v } and described above 
in connection with Figure 1 . These vectors are referenced to the entire warped video frame, as 
shown in Figure 1, and are derived from the corner pixel displacement vectors shown therein 
according to relationships described above. Resiliency is significantly enhanced by repeating this 
information in each encoded slice, since the information would reach the decoder even if some of 
the slices were lost in the transmission channel. The encoded GM vectors could be used with 
bilinear interpolation, in accordance with Equation (1) set forth above, to calculate the global 
motion compensation for each pixel contained in the slice 40. 

In a further embodiment, each encoded slice 40 would have GMC information 
comprising the global motion vectors r°> €> f> and 2^ wherein the vectors are referenced to the 
bounding box of the slice rather than to the entire frame. Referring further to Figure 6, there is 
shown bounding box 46 of slice 40 comprising the smallest rectangle that can contain the slice. 
Figure 6 shows corner pixels 46a-d at respective corners of bounding box 46, with vectors v 0 ^ 
v H0 , v 0k f and v Hy in this case specifying displacement of the respective comer pixels of the 
bounding box, rather than of the entire frame. The four vectors r°, r*, f y and r xy can be 
determined from the corner pixel vectors by means of the same relationships set forth above in 
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connection with H.263 for comer pixel displacement of an entire frame. From the four vectors 
coded for the bounding 46, the global motion compensation for each pixel contained in the slice 
40 may be readily calculated using bilinear interpolation, in accordance with Equation (1). It will 
be readily apparent that in this embodiment, different encoded slices 40 will contain different 
GMC information. 

Referring to Figure 7, there is again shown encoded slice 40 with bounding box 46. 
However, only two global motion vectors 48a and 48b are shown, which are encoded as the 
GMC information for slice 40. These vectors refer to the left-most and right-most pixels, 
respectively, of bounding box 46. Global motion compensation for respective pixels of slice 40 
can be determined therefrom. For pixels along a vertical axis, the global motion vectors are the 
same, whereas along a horizontal axis they are interpolated linearly from the two coded global- 
motion vectors 48a and 48b. 

In a further embodiment, all of the GMC information in a slice can be repeated on the 
picture or frame level, such as in the header of the frame 32. 

Referring to Figure 8, there is shown the bitstream syntax of the H.26L standard on 
picture and macroblock levels. In H.26L the macroblocks of an inter frame have one of several 
modes, which currently include 7 inter modes (16x16, 16x8, 8x16, 8x8, 8x4, 4x8 and 4x4), 1 
intra mode for 4x4 and 23 intra modes for 16x16. Here NxM refers to the size of the blocks the 
macroblock is partitioned into. For inter MBs, each block has an MV and for intra MBs, each 
block is predicted as a unit. Furthermore, there is one COPY mode, which uses no MV and no 
coefficients. This is the cheapest mode to signal. In fact, run-length coding is used to signal a 
number of copied (skipped) macroblocks with one codeword. 

A global-motion compensation has been proposed for the H.26L standard. It resembles 
the GMC used in Annex P of H.263 in the way the global motion vectors are defined. A main 
difference, however, is that the reference picture is not resampled and that the interpolated 
motion vectors do not apply to pixels but rather to blocks of pixels. The motion vector of an 
image block with its upper-left pixel as (x,y) can be derived as 



f * \ 
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where r°, r x , ?, and are related to v°° t y H0 , v ov } and v" K as for H.263 Annex P. However, these 
motion vectors apply to image blocks consisting of 4x4 pixels. In particular, the vectors . 
Y 00 , v H0 j v ov , and v HV apply to the corner blocks of the frame with their upper-left pixels at 
(0.0), (H-4.0), (0, V-4) and (H-4, V-4), respectively. 

Global Motion Vector Coding (GMVC) for H.26L is proposed to apply only for certain 
macroblock modes of the picture. Whether or not it is used is signaled for each inter frame in the 
picture header by a flag (GMVC flag). If GMVC is turned on the four GMVs r°, r x , f, and 
follow the flag. These are used for the current picture whenever a mode using GMVC is signaled 
in a macroblock. The proposed syntax is shown in Figure 9. 

The macroblock modes for macroblocks in a frame with GMVC enabled have two new 
modes. The COPY mode is replaced by GMVC_COPY and there is an additional mode called 
GMVCJ6. Both modes are inter 4x4 modes, i.e. the macroblock is partitioned into image 
blocks of 4x4 pixels. The motion vectors for each block is given by the interpolated GMVs as 
given by the above formula. For GMVC_COPY no coefficients are coded, i.e. the motion- 
compensated reference picture is copied, whereas for GMVC 16, coefficients are added as well. 

Referring to Figure 10, there is shown an encoded slice 50 comprising macroblocks 52, 
wherein each macroblock 52 is partitioned into 4x4 image blocks 54 in accordance with the 
H.26L standard. In Figure 1 0, interpolated GM vectors apply to 4x4 blocks 54 rather than to 
pixels. Referring further to Figure 10, there are shown corner blocks 54a-d having associated 
global motion vectors 56a-d, specifying their displacements. Corner blocks 54a-d define the 
corners of a rectangular bounding box 58 containing slice 50. The motion vectors 56a-d are 
represented by GMC information contained in header 60 of slice 50. From the information 
pertaining to GM vectors 56a-d, global motion compensation can be calculated for each of the 
4x4 blocks 54 of slice 50, by means of linear interpolation in accordance with Equation(2) above.' 

In a further embodiment, global motion compensation for each of the image blocks 54 
can be calculated from two encoded GM vectors (not shown) specifying displacement of the left- 
most and right-most 4x4 blocks contained within bounding box 58. In yet another embodiment, 
global motion compensation for each block 54 can be derived from a single encoded GM vector 
comprising the GMC information contained in header 60. 
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In another embodiment wherein a slice is GMC enabled, i.e. contains a signal indicating 
GMC information, the information comprises a COPY mode signal. In response to this signal in 
regard to a particular macroblock of the slice, the encoder will copy the corresponding 
macroblock from the global motion compensated reference picture, that is, from the warped 
frame as shown in Figure 1. In this embodiment coefficients pertaining to the macroblock may 
or may not be coded as well. 

In an embodiment referred to as Implicit global motion compensation (IGMC) no explicit 
motion vector is sent with the encoded slice 40. Instead, the macroblock COPY (aka SKIP) 
mode is reinterpreted as an Inter macroblock mode without coefficients or explicitly coded 
motion vectors. The motion vector used for motion compensating the macroblock is predicted 
from neighboring blocks. More specifically, in applying this embodiment to a particular image 
block in the current frame, a collocated block in a previous frame, dislocated by a motion vector 
is copied from the previous frame. The motion vector is predicted from neighboring blocks in the 
current frame, that is, from blocks which are adjacent to or proximate to the particular image 
block. The IGMC mode can be used to replace the COPY mode in coding an image block. 

As a farther feature of the IGMC embodiment, a bitstream representing a coded image 
block can include a syntactic element which may be interpreted to indicate either the IGMC 
mode or the COPY mode. A switch between the IGMC and COPY modes may be signalled 
implicitly, by means of other code elements. Alternatively, this switch may be signalled 
explicitly by a code word. 

In a modification of this embodiment, one extra motion vector is sent for each slice, to be 
used in predicting the first inter block of the slice. 

Obviously, many other modifications and variations of the present invention are possible 
in light of the above teachings. It is therefore to be understood that within the scope of the 
disclosed concept, the invention may be practiced otherwise than as has been specifically 
described. 
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What is claimed is: 

1 . In a method of video data compression for use with video frames 
comprising a plurality of image blocks, wherein each image block shall be decoded 

5 according to one of a plurality of coding modes, a method of decoding a particular 
image block in a current frame according to an implicit global motion 
compensation (IGMC) mode comprising the steps of: 

copying from a previous frame a collocated block dislocated by a motion 
vector; and 

10 predicting said motion vector from neighboring image blocks of said current 

frame. 

2. The method of Claim 1 wherein: 

one of said plurality of modes comprises a COPY mode used to copy a 
15 collocated block with zero displacement; and 

said IGMC mode is used to replace said COPY mode in decoding said 
particular image block. 

3. The method of Claim 2 wherein: 

20 a data bitstream representing said coded particular image block includes a 

syntactic element which is interpreted to indicate either said IGMC mode or said 
COPY mode. 

4. The method of Claim 3 wherein: 

25 a switch between said IGMC and COPY modes is signalled explicitly by a 

code word. 

5. The method of Claim 3 wherein: 

a switch between said IGMC and COPY modes is signalled implicitly by 
30 previously decoded code elements. 
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6. The method of Claim 1 wherein: 

said motion vector prediction is computed, for each vector component 
individually, as the median of three neighboring motion vectors. 

5 

7. A video decoder for decoding an image block in a current frame 
according to one of a plurality of coding modes, said video decoder comprising: 

means for copying from a previous frame a collocated bock dislocated by a 
motion vector according to an implicit global motion compensation (IGMC) mode; 
10 and 

means for predicting said motion vector from neighboring image blocks of 
said current frame according to said IGMC mode. 

8. A video decoder according to Claim 7 comprising: 

15 means for copying a collocated block with zero displacement according to a 

COPY mode; and 

means for replacing said COPY mode with said IGMC mode in decoding a 
particular image block. 

20 9. A video decoder according to Claim 8 comprising: 

means for switching between said COPY mode and said IGMC mode in 
decoding a particular image block. 

1 0. The method of Claim 1 wherein: 

25 an image block comprises 1 6x1 6 pixels having only one motion vector. 

1 1 . The method of Claim 1 wherein: 

an image block comprises 16x16 pixels partitioned into 4x4 pixel blocks 
having one motion vector each. 

30 
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12. The method of Claim 1 wherein: 

an image block comprises 16x16 pixels partitioned into a plurality of blocks 
having one motion vector each. 

5 13. The method of Claim 1 wherein: 

a frame is segmented into slices, wherein each slice comprises one or more 
image blocks. 

14. The method of Claim 13 wherein: 

10 an extra motion vector is sent with said particular slice to be used in 

determining the motion vector for the first block of said particular slice. 

15. A method of video data compression for use with slices derived by 
segmenting a video frame into a sequence of slices, wherein each slice comprises 

15 one or more image blocks, said method comprising the steps of : 

identifying a signal in each encoded slice that indicates whether global 
motion compensation (GMC) is to be used in reconstructing the slice; 

identifying GMC information in an encoded slice if GMC is to be used in 
reconstructing the slice; and 
20 decoding the data of respective encoded slices to generate corresponding 

decoded slices. 

1 6. The method of Claim 1 5 wherein: 

motion vectors for each image block of a particular encoded slice can be 
25 constructed using GMC information contained only in said particular encoded slice. 

1 7. The method of Claim 1 5 wherein: 

at least two of said encoded slices contain the same GMC information. 
30 18. The method of Claim 17 wherein: 
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said GMC information contained by said at least two encoded slices 
comprises global motion (GM) vectors referenced to said video frame. 

1 9. The method of Claim 1 5 wherein: 
5 at least two of said encoded slices contain different GMC information. 

20. The method of Claim 15 wherein: 

said method includes the step of parsing a frame header in which all GMC 
information contained in respective encoded slices is repeated. 

21 . The method of Claim 15 wherein: 

said GMC information contained in a given encoded slice comprises a set of 
encoded GM vectors; and 

said method includes the step of using said set of GM vectors, together with 
bilinear interpolation, to calculate the global motion compensation for each pixel 
contained in said given encoded slice. 

22. The method of Claim 21 wherein: 

said GM vectors of said set specify the displacements of respective corner 
20 pixels of the bounding box of said given slice. 

23. The method of Claim 21 wherein: 

said vector set comprises two coded GM vectors specifying displacement of 
the left-most and right-most pixels, respectively, of the bounding box of said given 
25 slice. 

24. The method of Claim 15 wherein: 

said GMC information contained in a given encoded slice comprises a set of 
encoded GM vectors; and 
30 said method includes using said set of GM vectors, together with bilinear 
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interpolation, to calculate global motion compensation for each block in an array of 
4x4 pixel blocks comprising said given encoded slice. 

25. The method of Claim 24 wherein: 
5 said GM vectors of said set specify the displacements of 4x4 blocks located 

at respective corners of the bounding box of said given slice. 



26. The method of Claim 24 wherein: 

10 said vector set comprises two encoded GM vectors specifying displacement 

of the left-most and right-most 4x4 blocks, respectively, of the bounding box of 
said given slice. 

27. The method of Claim 24 wherein: 

15 said vector set comprises a single encoded GM vector for use in calculating 

global motion compensation for respective 4x4 blocks of said given slice. 

28. The method of Claim 15 wherein: 

a particular encoded slice has a macroblock COPY mode that signals 
20 encoding of the image blocks of said particular encoded slice by copying 
respectively corresponding image blocks located in a global motion compensated 
reference frame derived from said video frame. 

29. The method of Claim 28 wherein: 

25 coefficients for use in predicting motion compensation are not encoded for 

respective image blocks of said particular encoded slice. 

30. The method of Claim 28 wherein: 

coefficients for use in predicting motion compensation are encoded for 
30 respective image blocks of said particular encoded slice. 



SUBSTITUTE SHEET (RULE 26) 



WO 03/047268 



1/4 



PCT/SE02/02206 




WO 03/047268 



2/4 



PCT/SE02/02206 








VIDEO 




XMTR 


COMPRESS 







-22 






^-18 


RCVR 




VIDEO 




DECODER 



FIG. 2 



20 



24 



16 x 16 






pixel 




Transform 


block 








30 



Output Bit Stream 



FIG. 3 



WO 03/047268 



3/4 



PCT/SE02/02206 



42b 



40c 



42c 




40a 


40b 






40c 


40d 







FIG. 5 



yOO 



46 



v ov 




46a 44 

c 




40 



HDR- 
GMC 
INFO 



MB 



MB 



MB 



MB 



MB 



MB 



MB 



MB 



42 



MB 



MB 



MB 



MB 



MB 



MB 



42 



46 



FIG. 6 



v HO 




46b 



MB 



MB 



V HV 



46d 



44 



40 



42 



46 — 




HDR- 
GMC 
INFO 


MB 


MB 


MB 


MB 


MB 


48a^ 


MB 


MB 


MB 


MB 


MB 


MB 




MB 


MB 


MB 


MB 


MB 













FIG. 


7 


^-46 






48b 



WO 03/047268 



4/4 



PCT/SE02/02206 



Picture 
Header 


MB 


MB 




MB 



MB 
Mode 



Inter MV (if not COPY) 
or Intra prediction 



Coefficients 
(if ony) 



FIG. 8 



Picture 


GMVC Rag 


GMVs 


MB 


MB 




MB 


Header 


(if INTER) 


(if GMVC on) 







MB 
Mode 
(GMVC) 



Inter MVs (if not GMVC modes) 



Coefficients 
(if any) 



FIG. 9 



56a 



52 ^-58 



60 











T 








INFO 










MB 


MB 


MB 










7 




— 4- 













56c^" 54c-" ^54 

50 



FIG. 10 



56b 



C 



54b 



54" 



54d 



56d 



(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 

International Bureau 

(43) International Publication Date 
5 June 2003 (05.06.2003) 





PCT 



(10) International Publication Number 

WO 03/047268 A3 



(51) International Patent Classification 7 : 

7/36, 7/50 



H04N 7/26, 



(21) International Application Number: PCT/SE02/02206 

(22) International Filing Date: 

29 November 2002 (29.11 .2002) 



(25) Filing Language: 

(26) Publication Language: 



English 
English 



(30) Priority Data: 

60/334,979 
10/306,349 



30 November 2001 (30.1 1.2001) US 
27 November 2002 (27. 1 1 .2002) US 



< 



(71) Applicant: TELEFONAKTIEBOLAGET LM ERICS- 
SON (PUBL) [SE/SE]; S-Stockholm 126 25 (SE). 

(72) Inventors: SJOBERG, Rickard; Grondalsvagen 8, S-147 
30 Tumba (SE). EINARSSON, Torbjorn; Fridhemsgatan 
29 A, S-l 12 40 Stockholm (SE). FROJDH, Per; Tomtebo- 
gatan 21, S-l 13 39 Stockholm (SE). 

(74) Agent: Kajsa Boestad, et al.; Ericsson AB, Patent Unit 
Core Networks Kista, S-l 64 80 Stockholm (SE). 



(81) Designated States (national): AE, AG, AL, AM, AT, AU, 

AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EC, EE, ES, FI, GB, GD, GE, GH, 
GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, 
LK, LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, 
MX, MZ, NO, NZ, OM, PH, PL, PT, RO, RU, SD, SE, SG, 
SI, SK, SL, TJ, TM, TN, TR, TT, TZ, UA, UG, UZ, VN, 
YU, ZA, ZM, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZM, ZW), 
Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European patent (AT BE, BG, CH, CY, CZ, DE, DK, EE, 
ES, FI, FR, GB, GR, IE, IT, LU, MC, NL, PT, SE, SK, 
TR), OAPI patent (BF, BJ, CF, CG, CI, CM, GA, GN, GQ, 
GW, ML, MR, NE, SN, TD, TG). 

Published: 

with international search report 

(88) Date of publication of the international search report: 

9 October 2003 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations'* appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



(54) Title: GLOBAL MOTION COMPENSATION FOR VIDEO PICTURES 




O 



8— i 



(57) Abstract: A system and a method for coding and decoding video data are invented. In a system and method of video data 
compression a video frame (32) is divided into a sequence of image blocks (38), wherein one of several possible block-coding modes 
is an implicit global motion compensation (IGMC) mode, which is used to copy pixels from a previous frame (32) displaced by a 
predicted motion vector. In another embodiment of the invention, a system and method of a video data compression, a video frame 
(32) is segmented into a sequence of slices (36), wherein each slice (36) includes a number of macroblocks (38). Respective slices 
(36) are encoded and a signal is included in the header (44) of an encoded slice (40) to indicate whether the slice (40) is GMC 
enabled, that is, whether global motion compensation is to be used in reconstructing the encoded slice. If so, GMC information, such 
as information representing a set of motion vectors (42a-42d), is included with the slice. In a useful embodiment each slice (36) of 
a frame (32) contains the same GMC information, to enhance resiliency against errors. In another embodiment different slices (36) 
of a frame (32) contain different GMC information. In either embodiment, motion vectors (42a-42d) for each image of a particular 
encoded slice (40) can be reconstructed using GMC information contained only in the particular encoded slice. 



INTERNATIONAL SEARCH REPORT 



Intei | Application No 

PCT7SE 02/02206 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 7 H04N7/26 H04N7/36 H04N7/50 




According to International Patent Classification (IPC) or to both national classification and IPC 




B. FIELDS SEARCHED 


Minimum documentation searched (classification system followed by classification symbols) 

t nr "7 uf\A m 
IPC / HU4N 


Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 


Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 

EPO-Intemal, WPI Data, PAJ, INSPEC, COMPENDEX, IBM-TDB 


C. DOCUMENTS CONSIDERED TO BE RELEVANT 


Category 0 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


X 


WO 00 70879 A (HUI YAU WAI LUCAS ;0H HONG 

LYE (SG); STMICR0ELECTR0NICS ASIA PACIF) 

23 November 2000 (2000-11-23) 

page 7, line 7-19 

page 12, line 2-7 

abstract; claim 1; figures 2,6 


1-30 


A 


W0 98 52356 A (CHANG SHIH FU ;UNIV 
COLUMBIA (US); MENG JIANHA0 (US)) 
19 November 1998 (1998-11-19) 
page 2, line 6-9 
page 4 9 line 6-14 
page 22, line 1-5 
abstract; figure 6 


1-30 


A 


US 5 565 920 A (DICKINSON BRADLEY W ET 
AL) 15 October 1996 (1996-10-15) 
the whole document 


1-30 


j j Further documents are listed in the continuation of box C. 


|)( j Patent family members are listed in annex. 


° Special categories of cited documents : 

"A" document defining the general state of the art which is not 
considered to be of particular relevance 

"E" earlier document but published on or after the international 
filing date 

"L" document which ma/ throw doubts on priority claim(s) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

"O" document referring to an oral disclosure, use, exhibition or 
other means 

"P" document published prior to the international filing date but 
later than the priority date claimed 


T" later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

"X" document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

"Y" document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

"&" document member of the same patent family 


Date of the actual completion of the international search 


Date of mailing of the international search report 


5 March 2003 


3 T 03. 2003 


Name and mailing address of the ISA 

European Patent Office, P.B. 5818 Patentlaan 2 
NL-2280 HV Rijswijk 
Tel. (+31-70) 340-2040, Tx. 31 651 epo nl, 
Fax: (+31-70) 340-3016 


Authorized officer 

JESPER BERGSTRAND/JA A 



Form PCT/ISA/210 (second sheet) {July 1992) 



INTERNATIONAL SEARCH REPORT 



Into | Application No 

PCT/SE 02/02206 



Patent document 
cited in search report 



WO 0070879 



W0 9852356 



US 5565920 



Publication 
date 



Patent family 
member(s) 



A 



23-11-2000 



wo 

EP 



0070879 Al 
1181828 Al 



19-11-1998 



W0 9852356 Al 
JP 2001526859 T 



15-10-1996 



US 
AU 
CA 
EP 
JP 
W0 



5592226 A 
1729495 A 
2182005 Al 
0741945 Al 
9509024 T 
9520863 Al 



Publication 
date 



23-11-2000 
27-02-2002 



19-11-1998 
18-12-2001 



07-01-1997 
15-08-1995 
03-08-1995 
13-11-1996 
09-09-1997 
03-08-1995 



Form PCT/ISA/210 (patent {amity annex) (July 1992} 



